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Abstract 



We investigate the asymptotic behavior of the steady-state queue length distribution under general- 
ized max-weight scheduling in the presence of heavy-tailed traffic. We consider a system consisting of 



other receives light-tailed traffic. We study the class of throughput optimal max-weight-a scheduling 
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OQ ' two parallel queues, served by a single server One of the queues receives heavy-tailed traffic, and the 

^D [ policies, and derive an exact asymptotic characterization of the steady-state queue length distributions. 

In particular, we show that the tail of the light queue distribution is heavier than a power-law curve, 

whose tail coefficient we obtain explicitly. Our asymptotic characterization also contains an intuitively 
^% ' surprising result - the celebrated max-weight scheduling policy leads to the worst possible tail of the 

y^ , light queue distribution, among all non-idling policies. 

Motivated by the above 'negative' result regarding the max-weight-a policy, we analyze a log- 

max-weight (LMW) scheduling policy. We show that the LMW policy guarantees an exponentially 

decaying light queue tail, while still being throughput optimal. 



I. Introduction 

Traditionally, traffic in telecommunication networks has been modeled using Poisson and Markov- 
modulated processes. These simple traffic models exhibit 'local randomness', in the sense that much 
of the variability occurs in short time scales, and only an average behavior is perceived at longer time 
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scales. With the spectacular growth of packet-switched networks such as the internet during the last 
couple of decades, these traditional traffic models have been shown to be inadequate. This is because 
the traffic in packetized data networks is intrinsically more 'bursty', and exhibits correlations over 
longer time scales than can be modeled by any Markovian point process. Empirical evidence, such as 
the famous Bellcore study on self-similarity and long-range dependence in ethernet traffic |[T3l lead to 
increased interest in traffic models with high variability. 

Heavy-tailed distributions, which have long been used to model high variability and risk in finance and 
insurance, were considered as viable candidates to model traffic in data networks. Further, theoretical 
work such as |12|, linking heavy-tails to long-range dependence (LRD) lent weight to the belief that 
extreme variability in the internet file sizes is ultimately responsible for the LRD traffic patterns reported 
in |[T3l and elsewhere. 

Many of the early queueing theoretic results for heavy-tailed traffic were obtained for the single 
server queue; see H, Q, ifTSl for surveys of these results. It turns out that the service discipline plays 
an important role in the latency experienced in a queue, when the traffic is heavy-tailed. For example, 
it was shown in |[1] that any non-preemptive service discipline leads to infinite expected delay, when 
the traffic is sufficiently heavy-tailed. Further, the asymptotic behavior of latency under various service 
disciplines such as first-come-first-served (FCFS) and processor sharing (PS), is markedly different 
under light-tailed and heavy-tailed scenarios lH, ||23l . This is important, for example, in the context of 
scheduling jobs in server farms lUTI . 

In the context of communication networks, a subset of the traffic flows may be well modeled as 
heavy-tailed, and the rest better modeled as light-tailed. In such a scenario, there are relatively few 
studies on the problem of scheduling between the different flows, and the ensuing nature of interaction 
between the heavy-tailed and light-tailed traffic. Perhaps the earliest, and one of the most important 
studies in this category is |3|, where the interaction between light and heavy-tailed traffic flows under 
generalized processor sharing (GPS) is studied. In that paper, the authors derive the asymptotic workload 
behavior of the light-tailed flow, when its GPS weight is greater than its traffic intensity. 

One of the key considerations in the design of a scheduling policy for a queueing network is 
throughput optimality, which is the ability to support the largest set of traffic rates that is supportable 
by a given queueing network. Queue length based scheduling policies, such as max-weight scheduling 
II2TI . II22I and its many variants, are known to be throughput optimal in a general queueing network. 
For this reason, the max-weight family of scheduling policies has received much attention in various 
networking contexts, including switches |[T5]| . satellites lfT6]| . wireless ifTTll . and optical networks ||6]|. 



In spite of a large and varied body of literature related to max-weight scheduling, it is somewhat sur- 
prising that the policy has not been adequately studied in the context of heavy-tailed traffic. Specifically, 
a question arises as to what behavior we can expect due to the interaction of heavy and light-tailed 
flows, when a throughput optimal max-weight-like scheduling policy is employed. Our present work is 
aimed at addressing this basic question. 

In a recent paper (Ml, a special case of the problem considered here is studied. Specifically, it was 
shown that when the heavy-tailed traffic has an infinite variance, the light-tailed traffic experiences an 
infinite expected delay under max-weight scheduling. Further, it was shown that the max-weight policy 
can be tweaked to favor the light-tailed traffic, so as to make the expected delay of the light-tailed traffic 
finite. In the present paper, we considerably generalize these results by providing a precise asymptotic 
characterization of the occupancy distributions under the max-weight scheduling family, for a large 
class of heavy-tailed traffic distributions. 

We study a system consisting of two parallel queues, served by a single server. One of the queues is 
fed by a heavy-tailed arrival process, while the other is fed by light-tailed traffic. We refer to these queues 
as the 'heavy' and 'Ught' queues, respectively. In this setting, we analyze the asymptotic performance of 
max-weight-a scheduling, which is a generalized version of max-weight scheduling. Specifically, while 
max-weight scheduling makes scheduling decisions by comparing the queue lengths in the system, the 
max-weight-a policy uses different powers of the queue lengths to make scheduling decisions. Under 
this policy, we derive an exact asymptotic characterization of the light queue occupancy distribution, 
and specify all the bounded moments of the queue lengths. 

A surprising outcome of our asymptotic characterization is that the 'plain' max-weight scheduling 
policy induces the worst possible asymptotic behavior on the light queue tail. We also show that by 
a choice of parameters in the max-weight-a policy that increases the preference afforded to the light 
queue, the tail behavior of the light queue can be improved. Ultimately however, the tail of the light 
queue distribution is lower bounded by a power-law-like curve, for any scheduling parameters used in 
the max-weight-a scheduling policy. Intuitively, the reason max-weight-a scheduling induces a power- 
law-like decay on the light queue distribution is that the light queue has to compete with a typically 
large heavy queue for service. 

The simplest way to guarantee a good asymptotic behavior for the light queue distribution is to give 
the light queue complete priority over the heavy queue, so that it does not have to compete with the 
heavy queue for service. We show that under priority for the light queue, the tail distributions of both 
queues are asymptotically as good as they can possibly be under any poUcy. Be that as it may, giving 
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Fig. 1. A system of two parallel queues, with one of them fed with heavy-tailed traffic. 

priority to the light queue has an important shortcoming - it is not throughput optimal for a general 
constrained queueing system. 

We therefore find ourselves in a situation where on the one hand, the throughput optimal max-weight- 
a scheduling leads to poor asymptotic performance for the light queue. On the other hand, giving 
priority to the light queue leads to good asymptotic behavior for both queues, but is not throughput 
optimal in general. To remedy this situation, we propose a throughput optimal log-max-weight (LMW) 
scheduling policy, which gives significantly more importance to the light queue compared to max- 
weight-a scheduling. We analyze the asymptotic behavior of the LMW policy and show that the light 
queue occupancy distribution decays exponentially. We also obtain the exact large deviation exponent 
of the light queue tail under a regularity assumption on the heavy-tailed input. Thus, the LMW policy 
has both desirable attributes - it is throughput optimal, and ensures an exponentially decaying tail for 
the light queue distribution. 

The remainder of this paper is organized as follows. In Section |lll we describe the system model. 
In Section |llll we present the relevant definitions and mathematical preliminaries. Section |IV] deals 
with the queue length behavior under priority scheduling. Sections IVl and IVlIl respectively contain our 
asymptotic results for max-weight-a scheduling, and the LMW policy. We conclude the paper in Section 

lYml 

II. System Model 

Our system consists of two parallel queues, H and L, served by a single server, as depicted in Fig. [T] 
Time is slotted, and stochastic arrivals of packet bursts occur to each queue in each slot. The server 
is capable of serving one packet per time slot from only one of the queues according to a scheduling 
policy. Let Hit) and L(t) denote the number of packets that arrive during slot ttoH and L respectively. 
Although we postpone the precise assumptions on the traffic to Section IIII-BI let us loosely say that 
the input L{t) is light-tailed, and H{t) is heavy-tailed. We will refer to the queues H and L as the 
heavy and Ught queues, respectively. The queues are assumed to be always connected to the server. Let 



qnit) and qiit), respectively, denote the number of packets in H and L during slot t, and let qu and 
qi denote the steady-state queue lengths, when they exist. Our aim is to characterize the behavior of 
IP {ql > b} and P {q^ > 6} as 6 becomes large, under various scheduling policies. 

III. Definitions and Mathematical Preliminaries 
A. Heavy-tailed distributions 

We begin by defining some properties of tail distributions of non-negative random variables. 

Definition 1: A random variable X is said to be light-tailed if there exists 6 > ior which 
E [exp(0X)] < oo. A random variable is heavy-tailed if it is not light-tailed. 

In other words, a light-tailed random variable is one that has a well defined moment generating 
function in a neighborhood of the origin. The complementary distribution function of a light-tailed 
random variable decays at least exponentially fast. Heavy-tailed random variables are those which have 
complementary distribution functions that decay slower than any exponential. This class is often too 
general to study, so sub-classes of heavy-tailed distributions, such as sub-exponentials have been defined 
and studied in the past ||20l . We now review some definitions and existing results on some relevant 
classes of heavy-tailed distributions. In the remainder of this section, X will denote a non-negative 
random variable, with complementary distribution function F{x) = F {X > x} . For the most part, we 
adhere to the terminology in 121, 171 . 

Notation: If f{x) and g{x) are positive functions defined on [0, cxo], we write f{x) ~ g{x) to mean 



Similarly, f{x) > g{x) means 

lini inf 

x-j-oo g[x 



liniinf^>l. 



Definition 2: 1) F{x) is said to have a regularly varying tail of index v, notation F E 7^(z^), if 

lim SM = k'-^ yk>o. 

2) F{x) is said to be extended-regularly varying, notation F G SIZ, if for some real c,d > 0, and 

k'^ < liminf S^ < limsupS^ < k", Vk € [l,r]. 

x-)-oo F(^x) x-^oo F{x) 



3) F{x) is said to be intermediate-regularly varying, notation F G ITZ, if 

lim lini inf -^= = lim lim sup ■ — '' 



fc^l x^oo F(^x) HI x^oo F{x) 

4) F{x) is said to be order-regularly varying, notation F € OTZ, if for some F > 1, 

'Fikx) Tikx) 

< liminf ^^ — - < limsup^^ — - < oo, V/c € [l,r]. 

x^co p(^x) x^oo F{x) 

It is easy to see from the definitions that TZ C £TZ C TTZ C OTZ. In fact, the containments are 
proper, as shown in Q. Intuitively, IZ is the class of distributions with tails that decay according to a 
power-law with parameter v. Indeed, it can be shown that 

F G 7^ ^^ F{x) = U{x)x-'', 

where U{x) is a slowly varying function, i.e, a function that satisfies U{kx) ~ U{x), V/c > 0. The other 
three classes are increasingly more general, but as we shall see, they all correspond to distributions that 
are asymptotically heavier than some power-law curve. In what follows, a statement such as X G ZTZ 
should be construed to mean P {X > x} G ZTZ. 

Next, we define the lower and upper orders of a distribution. 

Definition 3: 1) The lower order of F{x) is defined by 

^(F)=liminf-i^|^. 

a;-)-oo log X 

2) The upper order of F{x) is defined by 

logF(x) 



p{F) = lim sup 



logx 

It can be shown that for regularly varying distributions, the upper and lower orders coincide with the 
index u. It also turns out that both the orders are finite for the class OTZ, as asserted below. 

Proposition 1: p{F) < oo for every F G OTZ. 

Proof: Follows from Theorem 2.1.7 & Proposition 2.2.5 in 0- D 

The following result, which is a consequence of Proposition [H shows that every F G OTZ is 
asymptotically heavier than a power-law curve. 

Proposition 2: Let F G OTZ. Then, for each p > p{F), we have x^^ = o{F{x)) as x — )• oo. 
Proof: See Equation (2.4) in |[T9l . 

Definitions [2] and [3] deal with asymptotic tail probabilities of a random variable. Next, we introduce 
the notion of tail coefficient, which is a moment property. 



Definition 4: The tail coefficient of a random variable X is defined by 

Cx = sup{c>0 I E[X^] <oo}. 

In other words, the tail coefficient is the threshold where the power moment of a random variable starts 
to blow up. Note that the tail coefficient of a light-tailed random variable is infinite. On the other hand, 
the tail coefficient of a heavy-tailed random variable may be infinite (e.g., log-normal) or finite (e.g., 
Pareto). The next result shows that the tail coefficient and order are, in fact, closely related parameters. 

Proposition 3: \j The tail coefficient of X is equal to the lower order of F{x). 
Proof: Suppose first that the lower order is infinite, so that for any s > 0, we can find an x large 

enough such that 

log FIX > x} 

i > ^• 

logx 

Thus, for large enough x, we have 

F{X > x} <x"', V s >0. 

This implies E [X'^] < oo for all c > 0. Therefore, the tail coefficient of X is also infinite. 

Next suppose that ^(F) G (0, oo). We will show that (i) E [X'^] < oo for all c < ^(F), and 
(ii)E [X'^] = oo for all c > ^(F). To show (i), we argue as above that for large enough x, we have 
¥{X > x} < x"'*, when s < S,(F). Thus, E [X'^] < oo for all c < S,(F). To show (ii), let us consider 
some s such that c > s > ^(-F). By the definition of ^{F) there exists a sequence {xj} that increases 
to infinity as i — )■ oo, such that 

logF{X> Xi} 

logXi 

Therefore, 



< s, V i ^^ F{X > Xi} > x"', V i. 



/■OO /"OO 

E [X"] = I x^dFx{x) > / x^dFxix) > xlF{X > x,} > x'rx- 

Jo Jxi 



Vi, 



from which it follows that E [X'^] = oo. Therefore, the tail coefficient of X is equal to £,{F). □ 

We emphasize that Proposition [3] holds for any random variable, regardless of its regularity properties. 

Finally, we show that any distribution in OTZ necessarily has a finite tail coefficient. 
Proposition 4: If X € OTZ, then X has a finite tail coefficient. 
Proof: From Proposition [T] the upper order is finite: p{F) < oo. Thus, the lower order C{F) is also 

finite. But Proposition |3] asserts that lower order is equal to the tail coefficient. D 

'The first author is grateful to Jayakrishnan Nair (Caltech) for suggesting a proof of Proposition |3] via a personal 
communication. 



B. Assumptions on the arrival processes 

We are now ready to state the precise assumptions on the arrivals processes. 

1) The arrival processes H{t) and L{t) are independent of each other, and independent of the current 
state of the system. 

2) H{t) is independent and identically distributed (i.i.d.) from slot-to-slot. 

3) L{t) is i.i.d. from slot-to-slot. 

4) L(-) is light-tailed with E [L{t)] = Xl- 

5) H{-) G on with tail coefficient Cr > 1, and E [H{t)] = Xh- 

We also assume that Xl + Xh < 1, so that the input rate does not overwhelm the service rate. Then, 
it can be shown that the system is stablen under any non-idling policy, and that the steady-state queue 
lengths qh and qi exist. 

C. Residual and Age distributions 

Here, we define the residual and age distributions for the heavy-tailed input process, which will be 

useful later. First, we note that H{-) necessarily has a non zero probability mass at zero, since Xh < 1- 

Define H-^- as the strictly positive part oi H{-). Specifically, 

FiH(-)=m} 

F\H+ = m} = ^ ; \ , ^—, m = l,2,.... 

^ + ' 1-F{H{-) = 0}' 

Note that H^ has tail coefficient equal to Ch, and inherits any regularity property of H{-). 

Now consider a discrete-time renewal process with inter-renewal times distributed as H-^- . Let Hr € 

{1,2,...} denote the residual random variable, and Ha G {0, 1, ... } the age of the renewal process 

||9lo The joint distribution of the residual and the age can be derived using basic renewal theory: 

P{Hn = k,HA = l} = ^^^^^^^^^, ke{l,2...}, /€{0,1,...}. (1) 

The marginals of Hr and Ha can be derived from ([T]): 

P{Hn = k} = ^^§±^,kG{l,2,...}. (2) 

^The notion of stability used here is the positive recurrence of the system occupancy Markov chain. 

^We have defined the residual time and age such that if a renewal occurs at a particular time slot, the age at that time slot 
is zero, and the residual time is equal to the length of the upcoming renewal interval. 



F{H^ = k} = ^i§±^,ke{0,l,...}. (3) 

Next, let us invoke a useful result from the literature. 

Lemma 1: If H{-) € OTZ, then Hr G STZ, and 

nF{H+>n} 
sup mjrr ^ . <oo. (4) 

A corresponding result also holds for the age Ha- 

Proof: See Ui Lemma 4.2(i)]. D 

Using the above, we prove the important result that the residual distribution is one order heavier 
than the original distribution. 

Proposition 5: If H{-) € OTZ has tail coefficient equal to Ch, then Hr and Ha have tail coefficient 
equal to Ch — 1- 
Proof: According to (|4]|, we have, for all a and some real x, 

- log P {Hr > a} < - log a - log P {H+ > a} + x- 

Let us now consider the lower order of Hr : 

a->oo log a a— >-oo log a 

In the last step above, we have used the tail coefficient of H-^-. Since the lower order of Hr equals its 
tail coefficient (Lemma |3]l, the above relation shows that the tail coefficient of Hr is at most Ch — 1- 
Next, to show the opposite inequality, let us consider the duration random variable, defined as 

Hd = Hr + Ha. 

Using the joint distribution ([T]), we can obtain the marginal of H^ as 

Thus, for any e > 0, the Ch — 1 — e moment of Hu is finite: 



E 



■c.-i-^ _ sr^ kC--^-^k¥{H+ = k} _ ^ 

^ \~ ^ E \H^] 



fc>i ' " ^ ^ 



< oo. 



Since Hr is stochastically dominated by Hd, it is immediate that E 






< OO. Therefore, the 



tail coefficient of Hr is at least Ch — 1, and the proposition is proved. D 
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IV. The Priority Policies 

In this section, we study the two 'extreme' scheduling policies, namely priority for L and priority 
for H. Our analysis helps us arrive at the important conclusion that the tail of the heavy queue is 
asymptotically insensitive to the scheduling policy. In other words, there is not much we can do to 
improve or hurt the tail distribution of H by the choice of a scheduling policy. Further, we show that 
giving priority to the light queue ensures the best possible asymptotic decay for both the queue length 
distributions. 

A. Priority for H 

In this policy, H receives service whenever it is non-empty, and L receives service only when H is 
empty. It should be intuitively clear at the outset that this policy is bound to have undesirable impact 
on the light queue. The reason we analyze this policy is that it gives us a best case scenario for the 
heavy queue. 

Our first result shows that the steady-state heavy queue occupancy is one order heavier than its input 
distribution. 

Theorem 1: Under priority scheduling for H, the steady-state queue occupancy distribution of the 
heavy queue satisfies the following bounds. 

1) For every e > 0, there exists a Ki/(e) such that 

^{qh >h} < KH{e)b-^^"~^~'\ V b. (5) 

2) 

^{qn >h}> \h^{Hr > 6} , V 6. (6) 

Further, qh is a heavy-tailed random variable with tail coefficient equal to Ch — 1- That is, for each 
e > 0, we have 

' '^ ^ ' < oo, (7) 



E 

and 

E 



<&''-' 



€r-'^ 



oo. (8) 



Proof: Equation (|7]) can be shown using a straightforward Lyapunov argument, along the lines of lfT4l 
Proposition 6]. Equation ([5]) follows from ([7]l and the Markov inequality. 
Next, to show ([S]), we consider a time instant t at steady-state, and write 

F{g//(i) > 5} =P{gH(i) > h\qH{t) > 0}F{qH{t) > 0} = XHP{qH{t) > b\qH{t) > 0} . 
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We have used Little's law at steady-state to write P{g//(t) > 0} = A^. Let us now lower bound 
the term P{g//(t) > b\qH{t) > 0} . Conditioned on H being non-empty, denote by B{t) the number of 
packets that belong to the burst in service that still remain in the queue at time t. Then, clearly, qnit) > 
B{t), from which ¥ {qnit) > b\qH{t) > 0} > P < B{t) > b> . Now, since the H queue receives service 
whenever it is non-empty, it is clear that the time spent at the head-of-line by a burst is equal to its 
size. It can therefore be shown that in steady-state, B{t) is distributed according to the residual variable 
Hr. Thus,F{qH{t) > b\qHit) > 0} >F{Hr > b} , and ^ follows. Finally, §^ follows from © and 
Proposition [5] □ 

When the distribution of H{-) is regularly varying, the lower bound ^ takes on a power-law form 
that agrees with the upper bound ([5]). 

Corollary 1: If H{-) e n{CH), then 

F{qH >b}> U{b)b-^^"-^\ V 6, 

where U{-) is some slowly varying function. 

Since priority for H affords the most favorable treatment to the heavy queue, it follows that the 

asymptotic behavior of H can be no better than the above under any policy. 

Proposition 6: Under any scheduling policy, qu is heavy-tailed with tail coefficient at most Ch — 1- 
That is. Equation ([8]l holds for all scheduling policies. 

Proof: The tail probability P {qn > b} under any other policy stochastically dominates the tail under 
priority for H. Therefore, the lower bounds Q and ([8]l would hold for all policies. □ 

Interestingly, under priority for H, the steady-state light queue occupancy qL is also heavy-tailed 
with the same tail coefficient as qn- This should not be surprising, since the light queue has to wait 
for the entire heavy queue to clear, before it receives any service. 

Theorem 2: Under priority for H, qi is heavy-tailed with tail coefficient Ch — 1- Furthermore, the 
tail distribution P {q^ > b} satisfies the following asymptotic bounds. 

1) For every e > 0, there exists a ^^(e) such that 

F{qL>b}<HL{e)b-^^"-^-'\ (9) 

2) If H[-) G on, then 

F{qL>b}>\HFlHA>Y\ (10) 

Proof: The upper bound (|9ll is a special case of Theorem |4] given in the next section. Let us show 
(ITOl) . Notice first that the lower bound (ITOl) is asymptotic, unlike ^ which is exact. As before, let us 
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consider a time t at steady-state, and write using Little's law 

P {QLit) >b}>F {QLit) > b\qHit) > 0} P {qnit) > 0} = A^P {qiit) > blgnit) > 0} . 

Let us denote by A{t) the number of slots that the current head-of-Une burst has been in service. Clearly 
then, L has not received any service in the interval [t — A{t),t\, and has kept all the arrivals that occurred 
during the interval. Thus, conditioned on H being non-empty, qL{t) > '^a=t-A(t) ^('^)' Next, it can 
be seen that in steady-state, A{t) is distributed as the age variable Ha- Putting it all together, we can 
write 

P{9L >b}> XH^lqiit) > b\qH{t) > 0} > Aj^pj J^L(i) > b\ . (11) 

Next, since H{-) G OTZ, Lemma [T] implies that Ha G £Tl C X7^. We can therefore invoke Lemma |4] 
in the appendix to write 

p| J^L«>6l~p|i7A>^|. (12) 

Finally, ^ follows from (dB and (O. D 

We note that if H{-) is regularly varying, the lower bound ( fTOl ) takes on a power-law form that 
matches the upper bound ^. 

B. Priority for L 

We now study the policy that serves L whenever it is non-empty, and serves H only if L is empty. 
This policy affords the best possible treatment to L and the worst possible treatment to H, among all 
non-idling policies. Under this policy, L is completely oblivious to the presence of H, in the sense 
that it receives service whenever it has a packet to be served. Therefore, L behaves like a discrete time 
G/D/1 queue, with light-tailed inputs. Classical large deviation bounds can be derived for such a queue; 
see mOI for example. 

Recall that since L(-) is light-tailed, the log moment generating function 

AL(^) = logEre^-^(-) 

exists for some 9 > 0. Define 

EL = sup{e\AL{9)-9<0}. (13) 

Proposition 7: Under priority for L, qi satisfies the large deviation principle (LDP) 

\im-l\ogF{qL>b} = EL (14) 
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In words, the above proposition asserts that the tail of q^ is asymptotically exponential, with rate 
function E^. We will refer to El as the intrinsic exponent of the light queue. An equivalent expression 
for the intrinsic exponent that is often used in the literature is 

El = mf -Alii + a), (15) 

a>0 a 
where AJ^(-) is the Fenchel-Legendre transform lITOl of Al{6). 

It is clear that the priority policy for L gives the best possible asymptotic behavior for the light 
queue, and the worst possible treatment for the heavy queue. Surprisingly however, it turns out that the 
heavy queue tail under priority for L is asymptotically as good as it is under priority for H. 

Proposition 8: Under priority for L, qu is heavy-tailed with tail coefficient Ch — 1- 
Proof: This is a special case of Theorem |4l given in the next section. □ 

The above result also implies that the tail coefficient of H cannot be worse than Ch — 1 under any 
other scheduling policy. 

Proposition 9: Under any non-idling scheduling policy, qn has a tail coefficient of at least Ch — 1 • 
That is. Equation ([7]) holds for all non-idling scheduling policies. 

Proof: The tail probability P {qn > b} under any other policy is stochastically dominated by the tail 
probability under priority for L. D 

Propositions [6] and |9] together imply the insensitivity of the heavy queue's tail distribution to the 
scheduling policy. We state this important result in the following theorem. 

Theorem 3: Under any non-idling scheduling policy, qn is heavy-tailed with tail coefficient equal to 
Ch — 1- Further, F {qn > b} satisfies bounds of the form ^ and Q under all non-idling policies. 
Therefore, it is not possible to either improve or hurt the heavy queue's asymptotic behavior, by the 
choice of a scheduling policy. 

It is evident that the light queue has the best possible asymptotic behavior under priority for L. 
Although priority for L is non-idling, and therefore throughput-optimal in this simple setting, we are 
ultimately interested in studying more sophisticated network models, where priority for L may not 
be throughput optimal. We therefore analyze the asymptotic behavior of general throughput optimal 
policies belonging to the max-weight family. 

V. Queue Length Asymptotics for MAx-WEiGHT-a Scheduling 

In this section, we analyze the asymptotic tail behavior of the light queue distribution under max- 
weight-Q scheduling. For fixed parameters an > and ul > 0, the max-weight-o policy operates as 
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follows: During each time slot t, peri'orm the comparison 

and serve one packet from the queue that wins the comparison. Ties can be broken arbitrarily, but we 
break them in favor of the light queue for the sake of definiteness. Note that a^ = an corresponds to 
the usual max-weight policy, which serves the longest queue in each slot. ai/aH > 1 corresponds to 
emphasizing the light queue over the heavy queue, and vice-versa. 

We provide an asymptotic characterization of the light queue occupancy distribution under max- 
weight-a scheduling by deriving matching upper and lower bounds. Our characterization shows that the 
light queue occupancy is heavy-tailed under max-weight-a scheduling for all values of the parameters 
an and ul- Since we obtain distributional bounds on the light queue occupancy, our results also shed 
further light on the moment results derived in |[T4l for max-weight-a scheduling. 

A. Upper bound 

In this section, we derive two different upper bounds on the overflow probability P {qi > b} , that 
both hold under max-weight-a scheduling. However, depending on the values of an and ul, one of 
them would be tighter. The first upper bound holds for all non-idling policies, including max weight-a 
scheduling. 

Theorem 4: Under any non-idling policy, and for every e > 0, there exists a constant Ki{e) > 0, 
such that 

' < oo (16) 



E 
and 



^1-'-' 



-(C«-l-e) 



IP{9L >H <Ki(e)&^^ "~ '^ (17) 

Proof: Let us combine the two queues into one, and consider the sum input process H{t) + L{t) feeding 
the composite queue. The server serves one packet from the composite queue in each slot. Under any 
non-idling policy in the original system, the occupancy of the composite queue is given hy q = qu + QL- 
Let us first show that the combined input has tail coefficient equal to Ch- 

Lemma 2: The tail coefficient of i/(-) + L(-) is Ch- 
Proof: Clearly, E [{H + L)'^"+^] > E [H'^"+^] = oo, for every 5 > 0. We next need to show that 
E \{H + L)'-^"^''] < oo, for every 5 > 0. For a random variable X and event E, let us introduce 
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the notation K[X;E] = K[X1e], where 1e is the indicator of E. (Thus, for example, K[X] 

E[X;E]+E[X;E''].) Now, 



E 



[H + L)^" 



= E 


{H + Lf"-^]H-> 


> L 


+ E 


{H + Lf"-^; 


H <L 


< E 


{2Hf''-^-H>L 


+ E 


(2Lf"-^-H<L 




< 2^ 


«-<5|]E 


fjCu-S 


+ E 


'l^"- 


s' 


1 < oo 





where the last inequality follows from the tail coefficient of H{-), and the light-tailed nature of L(-).n 
The composite queue is therefore a G/D/1 queue with input tail coefficient Ch- For such a queue, it 
can be shown that 



E[g^" 



Ch-I- 



< (X). 



(18) 



This is, in fact, a direct consequence of Theorem [T] 

Thus, in terms of the queue lengths in the original system, we have 

E [{qn + qif"-^-'] < oo, 

< oo. This proves (IT6l ). To show dTTl ). we use the Markov 



from which it is immediate that E 
inequality to write 



,f^-- 



'{ql > 6} = p|g£"-i-^ > 6^"-i~^} < 



E 



q^L--'-' 



Wh- 



1-e 



<Ki(e)6-(^«-i-^). 



D 



The above result asserts that the tail coefficient of qi is at least Ch — 1 under any non-idling policy, 
and that ¥ {qi > b] i& uniformly upper bounded by a power-law curve. Our second upper bound is 
specific to max-weight-a scheduling. It hinges on a simple observation regarding the scaling of the a 
parameters, in addition to a theorem in [14|. We first state the following elementary observation due to 
its usefulness. 

Observation: (Scaling of a parameters) Let an and ai be given parameters of a max-weight-a policy, 
and let /3 > be arbitrary. Then, the max-weight-a policy that uses the parameters [iau and fiai for 
the queues H and L respectively, is identical to the original policy. That is, in each time slot, the two 
policies make the same scheduling decision. 

Next, let us invoke an important result from lfT4l . 

Theorem 5: If max-weight-a scheduling is performed with < an < Ch — 1, then, for any a/, > 0, 
we have E [g^"'] < oo. 
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Thus, by choosing a large enough ul, any moment of the hght queue length can be made finite, as 
long as an < Ch — 1- Our second upper bound, which we state next, holds regardless of how the a 
parameters are chosen. 
Theorem 6: Define 

7 = — (Ch-1). 
an 

Under max weight-a scheduling, and for every e > 0, there exists a constant K2(e) > 0, such that 



E 
and 



7— e 



< oo (19) 



F {ql > b} < K2{e)b-'-"'-'\ (20) 

Proof: Given e > 0, let us choose /3 = {CH — ^)/ciH — (-/ciL, and perform max-weight-a scheduling with 
parameters fian and 13a l- According to the above observation, this policy is identical to the original 
max-weight-Q policy. Next, since fiau < Ch — 1, Theorem |5] applies, and we have E g^"^ = 
E g^*^ < oo, which proves ( fT9l ). Finally, ( |20l ) can be proved using ( fT9l ) and the Markov inequality. 
D 

The above theorem asserts that the tail coefficient of q^ is at least 7 under the max weight-a 
policy. We remark that Theorem |4] and Theorem [6] both hold for max-weight-a scheduling with any 
parameters. However, one of them yields a stronger bound than the other, depending on the a parameters. 
Specifically, we have the following two cases: 

(i) -^^ < 1 : This is the regime where the light queue is given lesser priority, when compared to the 

heavy queue. In this case. Theorem |4] yields a stronger bound, 
(ii) -^^ > 1 : This is the regime where the light queue is given more priority compared to the heavy 
queue. In this case. Theorem [6] gives the stronger bound. 
Remark 1: The upper bounds in this section hold whenever H{-) is heavy-tailed with tail coefficient 
Ch- We need the assumption if (•) G OTZ only to derive the lower bounds in the next subsection. 

B. Lower bound 

In this section, we state our main lower bound result, which asymptotically lower bounds the tail of 
the light queue distribution in terms of the tail of the residual variable Hr. 
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Theorem 7: Let H{-) G OTZ. Then, under max-weight-a scheduling with parameters uh and ai, 
the distribution of the Ught queue occupancy satisfies the following asymptotic lower bounds: 
1) If ^ < 1, 

^{qL>b]>XH^\HR>^\ (21) 

'{qL>h}>\H^[HR>b(l + ^\\ {11) 

^Wl >h}> AhP [Hr > 6"-/--} . (23) 

As a special case of the above theorem, when H{-) is regularly varying with index Ch, the lower 
bounds take on a more pleasing power-law form that matches the upper bounds (fTTl) and (l20l) . 

Corollary 2: Suppose H{-) G 1Z{Ch)- Then, under max-weight-a scheduling with parameters uh 
and a^, the distribution of the light queue satisfies the following asymptotic lower bounds: 
^ < 1, 

'{qL>h]>U{h)h-^^--^) (24) 



2) If ^ = 1, 

' Oh ' 



3) If ^ > 1, 



1) If ^ < 1, 

^ Off — ' 



2) If ^ > 1, 



OLH 



^{qL>b}>U{b)b-\ (25) 

where [/(•) is some slowly varying function. 

It takes several steps to prove Theorem |71 we start by defining and studying a related fictitious 
queueing system. 

C. Fictitious system 

The fictitious system consists of two queues, fed by the same input processes that feed the original 
system. In the fictitious system, let us call the queues fed by heavy and light traffic H and L respectively. 
The fictitious system operates under the following service discipline. 

Service for the fictitious system: The queue H receives service in every time slot. The queue L 
receives service at time t if and only if qi{t)°'^ > q^{t)"^" . 

Note that if L receives service and H is non-empty, two packets are served from the fictitious system. 
Also, H is just a discrete time G/D /\ queue, since it receives service at every time slot. We now show 
a simple result which asserts that the light queue in the original system is 'longer' than in the fictitious 
system. 
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Proposition 10: Suppose a given input sample path feeds the queues in both the original and the 
fictitious systems. Then, for all t, it holds that qi{t) < qiit)- In particular, for each 6 > 0, we have 

F{qL>b}>F{q^>b}. 

Proof: We will assume the contrary and arrive at a contradiction. Suppose qi{0) = 91,(0), and that for 
some time t > 0, qi{t) > qL{t)- Let r > be the first time when qiir) > qbiT)- It is then necessary 
that qj^{T — 1) = gi(T — 1), since no more than one packet is served from a queue in each slot. Next, 
qi{T — 1) = qiir — 1), and qiir) > qL{T) together imply that L received service at time r — 1, but 
L did not. This is possible only if g//(r — 1) < g^(r — 1), which is a contradiction, since H receives 
service in each slot. □ 

Next, we show that the distribution of q^^ satisfies the lower bounds in Equations (|2TI)-(|23]). Theorem|7] 
then follows, in light of Proposition [TOl 

Theorem 8: In the fictitious system, the distribution of qj^ is asymptotically lower bounded as follows. 

1) If ^ < 1> 

F{qi>h]>\HF[HR>^\ (26) 

F{qi>h]> XhfIhr> h(l + ^\\ {11) 

F{qi>b]>\HF[HR>r-/''-^ (28) 

Proof: Let us consider an instant t when the fictitious system is in steady-state. Since the heavy queue 
in the fictitious system receives service in each slot, the steady-state probability P {g^ > O} = A^ by 
Little's law. Therefore, we have the lower bound 

W{qi>b]>\HF{q^>b\qfj>^]. 

In the rest of the proof, we will lower bound the above conditional probability. 

Indeed, conditioned on qpj > 0, denote as before by B{t), the number of packets that belong to the 
head-of-line burst that still remain in H at time t. Similarly, denote by A{t) the number of packets 
from the head-of-line burst that have already been served by time t. Since H is served in every time 
slot, A{t) also denotes the number of time slots that the HoL burst has been in service at H. 

The reminder of our proof shows that qi{t) stochastically dominates a particular heavy-tailed random 
variable. Indeed, at the instant t, there are two possibilities: 



2) If ^ 



3) If ^ > 1, 
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(a) g£(t)"^ > B{t)°'", or 

(b) g^(t)°- < ^(t)°-, 

Let us take a closer look at case (b) in the following proposition. 
Proposition 11: Suppose that 

g^(t)"^ <B{t)"". 

Let a <the the instant before t that L last received service. Then, the current head-of-line burst arrived 
at H after the instant a. 
Proof: We have 

qfliaT" < qiiar- < q^itr- < B{tr" . 

The first inequality holds because L received service at a, the second inequality is true since L does 
not receive service between a and t, and the final inequality is from the hypothesis. 

We have shown that (?/^(cj) < B{t), and hence the HoL burst could not have arrived by the time slot 
a. U 

The above proposition implies that if case (b) holds, L has not received service ever since the HoL 
burst arrived at H. In particular, L has not received service for A{t) time slots, and it accumulates all 
arrivals that occur during the interval \t — A{t)^t]. Let us denote the number of arrivals to L during 

this interval as 

t 

Sa= Y. ^«- 

i=t-A{t) 

In this notation, our argument above implies that if case (b) holds, then qi{t) > S^. Putting this 
together with case (a), we can conclude that 

(?^(t)>min(5(tr«/'^-, S^). (29) 

Therefore, 

P {qiit) >b}> XhF {S(t)""/"^ >b, S^>bY (30) 

Recall now that in steady-state, B{t) is distributed as Hr, and A{t) is distributed as Ha- Therefore, 
the above bound can be written as 

F{gz > ^} > AnPli^r^"' > b, f]i« > 4 • (31) 
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Lemma [5] shows that 

Notice that the assumption i7(-) G 07^ is used in the proof of Lemma [5] 

Theorem [8] now follows from the above asymptotic relation and (|3TI ). D 

Proof of Theorem [^ The result follows from Theorem [8] and Proposition [TOl □ 

VL Tail Coefficient of qi 

In this section, we characterize the exact tail coefficient of the light queue distribution under max- 
weight-a scheduling. In particular, we show that the upper bound ( fT6l ) is tight for -^^ < 1 , and ( fT9b is 
tight for ^ > 1. 

Theorem 9: The tail coefficient of the steady-state queue length qi of the light queue is given by 
(i) Ch - 1 for ^ < 1, and 
(ii) 7 = ^(C^-l)forf^>L 

Proof: Consider first the case -^^ < 1. The lower order (Definition [3]) of qi can be upper bounded 
using (I2TI ) or (l22l ) as follows 

lim mr : : < iim mi ■ 



b->-oo log 6 b->oo log 6 



liminf-'°'^';^"^°'^C„ 

a->-oo log a 



The last step is from Proposition |5] The above equation shows that the tail coefficient of qi^ is at most 
Ch — 1- However, it is evident from (IT6l ) that the tail coefficient of qi is at least Ch — 1- Therefore, 
the tail coefficient of q^ equals Ch — 1 for ^ < 1. This proves case (i) of the theorem. 
Next, consider -^^ > 1. Using (1231 ). we can upper bound the lower order of qi as 

^. ., \og¥{qL>h} ^ ,. . , \ogF{HR>h^-l^»] 
lini mi < iim mi 

fe->oo log h fe-i-oo log h 

= ^liminf-^°g^^^^-"^=^(CH-l) (32) 

an a-^co log a an 

Equation (l32l ) shows that the tail coefficient of q^ is at most 7. However, it is evident from (fT9l ) that 

the tail coefficient of qi is at least 7. Therefore, the tail coefficient of qi equals 7 = ^{Cr — 1) for 

^ > 1. This proves case (ii) of the theorem. D 
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Fig. 2. The tail coefficient of ql under max-weight a scheduling, as a function of aL/ctn, for Ch ~ 2.5. 



In Figure |2l we show the tail coefficient of qi as a function of the ratio ai/ciH- We see that the 
tail coefficient is constant at the value Ch — 1 as ai/dH varies from to 1. Recall that ai/aH = 1 
corresponds to max-weight scheduling, while ai/ctH i corresponds to priority for H. Thus, the tail 
coefficient of q^ under max-weight scheduling is the same as the tail coefficient under priority for H, 
implying that the max-weight policy leads to the worst possible asymptotic behavior for the light queue 
among all non-idling policies. However, the tail coefficient of qi begins to improve in proportion to 
the ratio ai/aH in the regime where the light queue is given more importance. 

Remark 2: If the heavy-tailed input has infinite variance {Ch < 2), then it follows from Theorem |9] 
that the expected delay in the light queue is infinite under max-weight scheduling. Thus, lfT4l Proposition 
5] is a special case of the above theorem. 



VII. Log-Max-Weight Scheduling 

We showed in Theorem |9] that the light queue occupancy distribution is necessarily heavy-tailed with 
a finite tail coefficient, under max-weight-a scheduling. On the other hand, the priority for L policy 
which ensures the best possible asymptotic behavior for both queues, suffers from possible instability 
effects in more general queueing networks. 

In this section, we propose and analyze the log-max-weight (LMW) policy. We show that the light 
queue distribution is light-tailed under LMW scheduling, i.e., that F {q^ > b} decays exponentially fast 
in b. However, unlike the priority for L policy, LMW scheduling is throughput optimal even in more 
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general settings. For our simple system model, we define the LMW policy as follows: 
In each time slot t, the log-max-weight policy compares 

qL{t)%log{l + qH{t)), 

and serves one packet from the queue that wins the comparison. Ties are broken in favor of the light 
queue. 

The main idea in the LMW policy is to give preference to the light queue to a far greater extent than 
any max-weight-a policy. Specifically, for ai/dH > 1, the max-weight-a policy compares q^ to a 
power of qh that is smaller than 1 . On the other hand, LMW scheduling compares qL to a logarithmic 
function of qh, leading to a significant preference for the light queue. It turns out that this significant 
de-emphasis of the heavy queue with respect to the light queue is sufficient to ensure an exponential 
decay for the distribution of q^ in our setting. 

Furthermore, the LMW policy has another useful property when the heavy queue gets overwhelmingly 
large. Although the LMW policy significantly de-emphasizes the heavy queue, it does not ignore it, 
unlike the priority for L policy. That is, if the H queue occupancy gets overwhelmingly large compared 
to L, the LMW policy will serve the H queue. In contrast, the priority for L policy will ignore any 
build-up in H, as long as L is non-empty. This property turns out to be crucial in more complex 
queueing models, where throughput optimality is non-trivial to obtain. For example, when the queues 
have time-varying connectivity, the LMW policy will stabilize both queues for all rates within the rate 
region, whereas priority for L leads to instability effects in H. 

Our main result in this section shows that under the LMW policy, P {ql > b} decays exponentially 
fast in b, unlike under max-weight-a scheduling. 

Theorem 10: Under log-max-weight scheduling, q^ is light-tailed. Specifically, it holds that 

liminf--logP{<7L > 6} > mm{EL, Cr - 1), (33) 

fe— >-oo 



where El is the intrinsic exponent, given by (fT3] ). (fTSl ). 
Proof: Fix a small 5 > 0. We first write the equality 
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P{gL >b} = F{qL> b; log(l + qh) < 5b} 

' . ' 

(i) 

+ P{qL> b; (1 - 6)b > log(l + qh) > Sb} 

^ V ' 

+ P {gL > b; log(l + qh) > (1 - 6)b} (34) 

^ V ' 

(Hi) 

We will next upper bound each of the above three terms on the right, 
(i) P{(7l > b; log{l + Qh) < Sb} : Intuitively, this event corresponds to an overflow of the light 
queue, when the Ught queue is not 'exponentially large' in b, i.e., qn < exp(56) — 1. Suppose 
without loss of generality that this event happens at time 0. Denote by — r < the last instant 
when the heavy queue received service. Since H has not received service since — r, it is clear that 
log(l + qui-r)) < 5b. Thus, qii-r) < Sb. 

In the time interval [— r + 1, 0] the light queue receives service in each slot. In spite of receiving 
all the service, it grows from less than 6b to overflow at time 0. This implies that every time the 
event in (i) occurs, there necessarily exists —u < satisfying 

J2 {m-i)>{i-6)b. 

i=—u+l 

Therefore, 



Y^ {m-l)>{l-5)b\. 

-—u+l ) 



^{qi > b; log(l + qH) <db} <¥i3u>0 
Letting Su = J2i=-u+i ^(^)' ^^ above inequality can be written as 



' {ql > b; log(l + qn) <5b}<F{ sup (Su -u)>{l-5)b\. (35) 



The right hand side of ( 135] ) is precisely the probability of a single server queue fed by the process 
L(-) reaching the level (1 — 6)b. Standard large deviation bounds are known for such an event. 
Specifically, from HOl Lemma 1.5], we get 

liminf--logp|supS'„-u> {I - 6)b\ > inf uA^ (1 + ^^ 

6^oo b {u>0 J ">0 V ^ 

= inf i^A2(l + a) = (1-5)^L. (36) 

a>0 a 
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From (|35] ) and (l36l ). we see that for every e > and for large enough b, 

^{qi > b; log{l + qH) < 5b} < Kie-^^^-^^^^'~'\ (37) 

(iii) Let us deal with the term (iii) before (ii). This is the regime where the overflow of L occurs, along 
with H becoming exponentially large in b. We have 

P {ql > 6; log(l + qn) > (1 - S)b} = P {^l > b; qh > e^^"^)^ - l} 

< F[qL + qH>e^'-''''} 

We have shown earlier in the proof of Theorem |4] that for any non-idling policy, 

P{qL + qH> M} < K2M-^^'"-^-'^ 

for every e > and some K2 > 0. Therefore, 

P{qL>b; log{l + qH)>{l-5)b}<K2e^p{-{l-5)b{CH-l-e)), V e > 0. (38) 

(ii) Let us now deal with the second term, P {q^ > b; (1 — 6)b > log(l + qn) > Sb} . Let us call this 
event £2- Suppose this event occurs at time 0. Denote by — r < the last time during the current 
busy period that H received service, and define 

T] = log(l + qH{-T)). 

If H never received service during the current busy period, we take r to be equal to the last instant 
that the system was empty, and 77 = 0. We can deduce that rj < {1 — 6)b, because H receives 
no service in [— r + 1,0]. It is also clear that qii—'T) < V- Therefore, L grows from less than r/ 
to more than 6, in spite of receiving all the service in [— r + 1,0]. Using u and ^ as 'dummy' 
variables that represent the possible values taken by r and rj respectively, we can write 

P{^2} < ^{^i<{l-6)b,u>Q\Su-u>b-i; qH{-u) + qL{-u)>e^^ 
(l-8)b 



< Yl ^{^u>0\s^-u>b-^; qH(.-n)+qL{-u)>e'^] 
5=0 

{l-5)b 

< J] J]p{s„-n>6-e; qH{-u)+qL{-u)>e^] 



5=0 u>0 

where the last two steps are by the union bound. Notice now that for every n > 0, the event 
Su — u > b — ^ is independent of the value of qni—u) + qL{~u), since these are determined by 
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arrivals in disjoint intervals. Therefore, continuing from above 

il-S)b 
= Yl Y.^{Su-u>b-^}F[qH{-u) + qL{-n)>e^\ 
^=0 u>0 
(1-5)6 

5=0 u>0 
(1-5)6 

< Yl '^ie-(^^-^)('-^)K2e-(^--i-^)^ V e > 0. 



(39) 



(40) 



Equation ( [39l ) follows from (ITT] ). and (l40l ) is a classical large deviation bound that follows, for 
example, from lITOl Lemma 1.5]. Thus, for every e > 0, 



(1-5)6 



-[(CH-l~e)(+{E^-e){b-0] 



P{^2}< Yl '^I'^se 

Let us now distinguish two cases: 

- Ch — 1 > El : In this case, we can bound the above probability as 

where k > is some constant. 

- Ch — 1 < El : In this case, 

P{^2} < Ke-^C^^-i-^^^i-"^), Ve > 0. 

Let us now put together the bounds on terms (i), (ii) and (iii) into Equation (l34l ). 
1) If Ch-1> El, we get from ^, ^, and 



F{qL>b}<e-^^^-^'>'^^^-''> 

from which it is immediate that 

1 



Kl + K2e 



-((l-S)b{CH-l-E^)) ^^ 



liminf--logP{gj: > 6} > {l-6){EL-e). 

6— i-oo 



Since the above is true for each e and 6, we get 



liminf--logP{gL > 6} > El- 

6— i-oo 



2) If Ch -I < El, we get from ([37]), (EH), and dH, 



^^^-{(l-SWE.-C.+l))^^^^^ 



(41) 



(42) 



(43) 



(44) 



(45) 



(46) 
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from which it is immediate that 



liminf-^logPjgL >6} > {l-6){CH-l-e). 

b-^oo b 

Since the above is true for each e and S, we get 

lim inf -^ log P {ql > 6} > C// - 1. (47) 

fe^oo 

Theorem [10] now follows from (|45]l and (|47]l. D 

Thus, the light queue tail is upper bounded by an exponential term, whose rate of decay is given by 
the smaller of the intrinsic exponent E^, and Ch — 1- We remark that Theorem [TOl utilizes only the 
light-tailed nature of L(-), and the tail coefficient oi H{-). Specifically, we do not need to assume any 
regularity property such as H{-) G OTZ for the result to hold. However, if we assume that the tail of 
H{-) is regularly varying, we can obtain a matching lower bound to the upper bound in Theorem [TOl 
Theorem 11: Suppose that H{-) € 1Z{Ch)- Then, under LMW scheduling, the tail distribution of qi 
satisfies an LDP with rate function given by 

lim -- logP{<7L >h} = min(EL, Ch - !)• 
Proof: In light of Theorem [TOl it is enough to prove that 

limsup--logP{gL > 6} < min(£'L, C// - 1). 

Let us denote by q£ the queue length of the light queue, when it is given complete priority over 
H. Note that P < q^ > & f is a lower bound on the overflow probability under any policy, including 
LMW. Therefore, for aU 6 > 0, P{gL > h] > PJgSf^ > h\ . This impUes 



lim sup — logP{(7L > ^} < limsup— -logP<^ gj^^ > 6> = E'l, 

where the last step is from (fT4l ). 

Next, we can show, following the arguments in Proposition [10] and Theorem [8] that 

P{'ZL>6}>A^piif,j>e''-l; YlL{i)>h\. 
But arguing similarly to Lemma [S] we can show that 

Ha 



(48) 



i=l J 
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Fig. 3. The large deviation exponent for ql under LMW scheduling, as a function of A^. The light queue is fed by Poisson 
bursts, and Ch =2.5. 



Thus, 

^{qL>h]>^[HR>e''-l]. 

Next, since H{-) is regularly varying with tail coefficient Ch, Hr is also regularly varying with tail 

coefficient Cr - 1, so that Pji^ij > e^ - 1} = U{e^)e~^^^"~^\ Finally we can write 

1 1 r ~i loe;[/fe^) 

limsup--logP{gL >b} <limsup--logP<^iJ/j > e'' - 1 [ =Cg-l-limsup , • 

The final limit supremum is shown to be zero in Lemma |6l using a representation theorem for slowly 
varying functions. Thus, 

1 



limsup -^ logP {gL >b} <Ch -I- 

6— 5>oo b 



(49) 



Equations (1481 ) and (1491 ) imply the theorem. D 

Figure [3] shows the large deviation exponent given by Theorem [TT] as a function of A/,, for Ch = 2.5, 
and Poisson inputs feeding the light queue. There are two distinct regimes in the plot, corresponding 
to two fundamentally different modes of overflow. For relatively large values of A^, the exponent for 
the LMW policy equals El, the intrinsic exponent. In this regime, the light queue overflows entirely 
due to atypical behavior in the input process L(-). In other words, ql would have grown close to the 
level b even if the heavy queue was absent. This mode of overflow is more likely for larger values of 
Xl, which explains the diminishing exponent in this regime. 

The flat portion of the curve in Figure |3] corresponds to a second overflow mode. In this regime, the 
overflow of the light queue occurs due to extreme misbehavior on the part of the heavy-tailed input. 
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Specifically, the heavy queue becomes larger than e^ after receiving a very large burst. After this instant, 
the heavy queue hogs all the service, and the light queue gets starved until it gradually builds up to 
the level b. In this regime, the light queue input behaves typically, and plays no role in the overflow 
of L. That is, the exponent is independent of A^,, being equal to a constant Ch — 1- The exponent is 
decided entirely by the 'burstiness' of the heavy-tailed traffic, which is reflected in the tail coefficient. 

VIII. Concluding Remarks 

We considered a system of parallel queues fed by a mix of heavy-tailed and light-tailed traffic, 
and served by a single server. We studied the asymptotic behavior of the queue size distributions 
under various scheduhng policies. We showed that the occupancy distribution of the heavy queue is 
asymptotically insensitive to the scheduling policy used, and inevitably heavy-tailed. In contrast, the 
light queue occupancy distribution can be heavy-tailed or light-tailed depending on the scheduling 
policy. 

The major contribution of the paper is in the derivation of an exact asymptotic characterization of the 
light queue occupancy distribution, under max-weight-a scheduling. We showed that the light queue 
distribution is heavy-tailed with a finite tail coefficient under max-weight-a scheduling, for any values 
of the scheduling parameters. However, the tail coefficient can be improved by choosing the scheduling 
parameters to favor the light queue. We also observed that 'plain' max-weight scheduling leads to the 
worst possible asymptotic behavior of the light queue distribution, among all non-idhng policies. 

Another important contribution of the paper is the log-max-weight pohcy, and the corresponding 
asymptotic analysis. We showed that the light queue occupancy distribution is light-tailed under LMW 
scheduling, and explicitly derived an exponentially decaying upper bound on the tail of the light queue 
distribution. Additionally, the LMW policy also has the desirable property of being throughput optimal 
in a general queueing network. 

Although we study a very simple queueing network in this paper, we believe that the insights obtained 
from this study are valuable in much more general settings. For instance, in a general queueing network 
with a mix of light-tailed and heavy-tailed traffic flows, we expect that the celebrated max-weight policy 
has the tendency to 'infect' competing light-tailed flows with heavy-tailed asymptotics. A similar effect 
was also noted in |14|, in the context of expected delay. 

We also believe that the LMW policy occupies a unique 'sweet spot' in the context of scheduling 
light-tailed traffic in the presence of heavy-tailed traffic. This is because the LMW policy de-emphasizes 
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the heavy-tailed flow sufficiently to maintain good light queue asymptotics, while also ensuring network- 
wide stabiUty. 

For future work, we propose the extension of the results in this paper to more general single-hop and 
multi-hop network models. Even in the context of parallel queues, the incorporation of time-varying 
channel models presents an interesting direction. 

Appendix 

Lemma 3: P {Hr > m, HA>n} = F {Hr > m + n} 
Proof: Using © and dD, 

k>ml>n ^ +-' 



oo 



^ V^ V^ ^ 1-^+ = P} 

Z^ Z^ E\H,] 

k>m p=k+n 

= ^f{HR = k + n} 

k>m 

= P {Hr >m + n}. 

D 
Lemma 4: Let N S ZTZ be a non-negative integer valued random variable. Let Xi,i > 1 be i.i.d. 
non-negative light-tailed random variables, with mean fi, independent of N. Define 

N 

Sn = / ^Xj. 

i=l 

Then, 

P {^Tv > 6} ~ P {iV > b/n} . 

Proof: For notational ease, we will prove the result for /i = 1, although the result and proof technique 
are applicable for any // > 0. First, for a fixed (5 > 0, we have 



P{57v>H = F{SN>b; N <b{l-6)}+F{SN>b- N>b{l-5)} 

< P{5l6(i_5)j >&}+P{Af>Kl-'5)}. (50) 

Next, we write a lower bound: 



30 



P{5w>6} > F{SN>b; N >b{l+5)} 

= ¥{N>b{l + 6)}-F{SN <b; N>b{l + 6)} 

> P {AT > 6(1 + 6)} - P {5rb(i+5)l < b} • (51) 

Since the Xi have a well defined moment generating function, their sample average satisfies an 
exponential concentration inequality around the mean. Specifically, we can show using the Chernoff 
bound that there exist constants k, t] such that 



Ke 



Thus, it follows that 



as 6 — )- oo. Similarly, 



P{%i_5)j >6}=o(P{iV>6}) (52) 

P{5Lfc(i+,)j <b}=oiF{N>b}). (53) 



Next, getting back to (1501) . 



y nSN^bl P{%i-^)J>&} , , F{N>b{l-d)} 

lim sup ^ , ^^ < lim sup ^ , ^^ — h hm sup ztttz 7^; • 

b-^oo P{N>b} - b^J P{iV>6} fc^oo P{N>h} 

The first term on the right hand side is zero in view of (l52l ). so that for all 6, we have 

F{SN>b}^^. F{N>b{l-d)} 

nm sup -— T -T- < lim sup -—, ^ . 

b^J F{N>b} - fc^oo P{iV>6} 

Taking the limit as 5 J, 0, 

F{SN>b}^.. ^. F{N>b{l-S)} 
lim sup -rr-r— -^ < lim lim sup ^ ,,, --; = 1 (54) 

b^^^ F {N > b} - Sio b^o. P{iV>6} 



The final limit is unity, by the definition of the class Z7?.. Similarly, we can show using (I5TI ). (1531 ) and 
the intermediate-regular variation of the tail of N that 

Iiminf'^y^->?>1. (55) 

b^oo F{N>b} - ^ ' 

Equations (l54l ) and (1551 ) imply the result. D 

The above lemma can be proved under more general assumptions than stated here, see |fT9Tl . 
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Lemma 5: If H{-) G OTZ, we have 



i=l 



(56) 



F{Hn>b+^} f^ = l, 
^ P{if/j>6"-/""} f^>l. 
Proof: In this proof, let us take Al = 1 for notational simplicity, although the same proof technique 
works without this assumption. Denote Sn = J2i=i L{i). We first get an upper bound. For every 5 > Q, 
we have 

¥[HR>r-'^"; SH,>b] = 

P [Hr > 6"-/"«; Sh, >b; Ha< b{l -6)] + 

<r{SH.>b; HA<b{l-5)} + 

<r{SH.>b; HA<b{l-6)} + 

<P{%i-5)j >b} + 

In (|58] ) we have utilized Lemma |3l and in Equation ( [59l ). we have used the independence of Ha and 
-L(-). Next, let us derive a lower bound. 

■ [Hr > 6--/--; Sh, >b; Ha> 6(1 + <5)} = 
'{Hr > 6"-/"-; Sh. <b; Ha> 6(1 + <5)} > 
'{5h. <6; HA>b{l + 6)}> 



' {Hr > 6"-/"«; Sh, >b; Ha> 6(1 - 5)} 
'{i^ij>6"-/"-; HA>b{l-6)] (57) 

'{^ij>6"^/"«+6(l-5)} (58) 

■ (Hr > 6"^/"" + 6(1 - 5)| . (59) 



P{^/1>6"-/"-; 5j^, >&} > 

{Fr>6"-/"-; /7^>6(1 + 5)} - 

{i^R>6°-/°-; /7a>6(1 + <5)} - 

P{i^ij>6"^/"«+6(l + (5)| - 



'{^1 



[6(1+5)1 



< 



6}. 



(60) 



Equation (l60l ) uses Lemma|3] Now, observe that the terms P 1 5*1 ^n _5)j > 6} in ( [59l ) and P 1 5'rb(i_|_5)i < 6} 
in (l60b decay exponentially fast as 6 — )■ oo, for any 6 > 0. This is because L(-) is light-tailed, and 
their sample average satisfies an exponential concentration inequality around the mean (unity). More 
precisely, a Chernoff bound can be used to show that 



and 



'{Sibii-5)\ >b}=o (f^Hr > 6"-/"- + 6}) , 
'{%i+5)l < 6} =o(f[Hr > fe"-/"- +6}) . 



(61) 



(62) 



Case (i): f^ < 1- Using (|59]l, we write 

F{HR>b"-/"-; SH.>b} ^^, P{5Lb(i-5)j >6} Pj//^ > 6°-/°« + 6(1 - 5)} 
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The first limit supremum on the right is zero in view of (|6TI ). Since -^^ < 1, we can write 

Thus, 

P{gfi>b"^/"^ SH.>b} ^^. P{ff^>b(l-(^)} 

limsup — ^^ — < limiimsup — — = 1. (63) 

b^oo r{HR>b} - Sio b^^^ F{HR>b} 

The final limit is unity, because according to Lemma[Tl H{-) € OTZ implies Hr € £'7^. Since STZ C X7^, 

the final limit in ( [63] ) is unity, by the definition of intermediate-regular variation (Definition |2]). 

Along similar lines, we can use ( [60l ). ( [62b . and the fact that Hr £ ZTZ to show that 

fe^oo P {il^R > 6} - 510 b^oo P {iJij >b} ^ ^ 

Equations ( [63] ) and ( [64] ) imply that 

P {i^/j > 6°"/°" ; SH,>b}^F {Hr > b} , 

which implies Lemma [5] for -^^ < 1, and Al = 1. 

Case (ii): ^ = 1. Similar to the previous case. Here, we get 

P {Hr > b; Sh, > 6} - P {Hr > 26} . 

Case (Hi): ^^ > 1. 

For the upper bound, we have from ( [59] ) and ([52]) . 

F{HR>b"-/""; SH^>b} ^^. P {Fr >&"-/"«+ 6(1 - <5)} ^^ 
lim sup — ^^ p ; — ^ < iim sup — ^^ p -, — ^ < 1 . 



Similarly, for the lower bound, we have from (1601 ) and 

,. . r{HR>b"-/""; SH,>b} ^^. . P{Fr>6°-/°-+6(1 + 5)} 

lini mi — -^ — ^ lini mi — -^ — 

b-,oo F{HR>b°"^/°"'} ~ b^oo Pjii'ij > 6"^/"^} ' 

F{Hr > 6"^/°«(l + (5)| 

> liminf ^ ,\ yS > 0. 

~ fe->oo P{i7ij > 6"^/"«} 

Thus, 

,. . F{HR>b^-/^"; SH.>b} ^^. ,. . P{Fr>6"-/"-(1 + (^)} 

limmi p ; — ^ > iimiimini p -, — ^ = 1, 

6-^oo Pji^ij > 6°^/°«} SIO b^oD PjifR > 6"i/"«} 

where the last limit is unity due to the intermediate -regular variation of Hr. Therefore, we can conclude 
for ^ > 1 that 

an 

F^Hr > 6"^/"«; Sh^ > b\ ~ PJi^/j > 6"^/"«| . 
Lemma \5\ is now proved. D 
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Lemma 6: For any slowly varying function f7(-), 



a->-oo log a 



Proof: We use the representation theorem for slowly varying functions derived in HI. For every slowly 
varying function U{-), there exists a S > such that for all x > i?, the function can be written as 



Uix) = exp I v{x) + I dy 

V Jb y 



where v{x) converges to a finite constant, and C,{x) — )• as x ^ oo. Therefore, 

, log^(a) ,. via) + Pb '-f^y , Ps'-f^y 
nm — = lim = lim — - — , 

a— i>oo log a a—KXi log a a— i-oo log a 

where the last step is because v{a) converges to a constant. Next, given any e > 0, choose C(e) such 
that |C(a)! < e, V a > C(e). Then, we have 



lim 

a— >oo 



< hm — — < hm ^— = e. 



log a a->-oo log a a->-oo log O 

Since the above is true for every e, the result follows. □ 
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